Performance Comparison of Neural Networks and GMM for Vocal/Nonvocal segmentation for Singer Identification

نویسندگان

  • Ananya Bonjyotsna
  • Manabendra Bhuyan
چکیده

Vocal and nonvocal segmentation is an important task in singing voice signal processing. Before identifying the singer it is necessary to locate the singer’s voice in a song. Maximum of the songs start with a piece of instrumental accompaniment known as ‘prelude’ in musical terms after which the singing voice comes into play. Therefore, it is necessary to detect the vocal region in the song in order to extract the singer’s voice characteristics and to avoid the non-vocal region which includes the instrumental accompaniment. This work thus classifies Vocal and Nonvocal region in songs using three different classifiers: Gaussian Mixture Model (GMM), Artificial Neural Network (ANN) with Feed Forward Backpropagation algorithm and Learning Vector Quantization (LVQ). Mel Frequency Cepstral Coefficient (MFCC) has been considered as the primary feature for classification. An available database MUSCONTENT is used and a newly created Database ASDB1 consisting of sixty excerpts from a wide variety of Assamese songs has been examined applying the same methods of classification. The efficacy of the classifiers has been tested and the results indicate that LVQ is a robust classifier compared to FFBP and GMM. manab@ tezu.ernet.in Keywords-Music information Retrieval (MIR), Singer Identification (SID), Gaussian Mixture Model (GMM), Artificial Neural Network (LVQ and FFPB), Mel Frequency Cepstral Coefficient (MFCC).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A BIC-based approach to singer identification

A new singer identification system is presented in this thesis. The system is based on the idea of using only the vocal segments of a song to build the model of a particular singer. The most important contribution of the technique is the way these vocal segments are located. The borders between vocal and instrumental parts are first detected with the Bayesian Information Criterion(BIC), which i...

متن کامل

A multi-scale convolutional neural network for automatic cloud and cloud shadow detection from Gaofen-1 images

The reconstruction of the information contaminated by cloud and cloud shadow is an important step in pre-processing of high-resolution satellite images. The cloud and cloud shadow automatic segmentation could be the first step in the process of reconstructing the information contaminated by cloud and cloud shadow. This stage is a remarkable challenge due to the relatively inefficient performanc...

متن کامل

Handwritten Character Recognition using Modified Gradient Descent Technique of Neural Networks and Representation of Conjugate Descent for Training Patterns

The purpose of this study is to analyze the performance of Back propagation algorithm with changing training patterns and the second momentum term in feed forward neural networks. This analysis is conducted on 250 different words of three small letters from the English alphabet. These words are presented to two vertical segmentation programs which are designed in MATLAB and based on portions (1...

متن کامل

Diagnosis of brain tumor using PNN neural networks

Cells grow and then need a very neat method to create new cells that work properly to maintain the health of the body. When the ability to control the growth of the cells is lost, they are unconsidered and often divided without order. Exemplified cells form a tissue mass called the tumor. In fact, brain tumors are abnormal and uncontrolled cell proliferations. Segmentation methods are used in b...

متن کامل

An Automated MR Image Segmentation System Using Multi-layer Perceptron Neural Network

Background: Brain tissue segmentation for delineation of 3D anatomical structures from magnetic resonance (MR) images can be used for neuro-degenerative disorders, characterizing morphological differences between subjects based on volumetric analysis of gray matter (GM), white matter (WM) and cerebrospinal fluid (CSF), but only if the obtained segmentation results are correct. Due to image arti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014